Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
seva | 1424 | 126 | 3 | 42.0000 |
Els | 1293 | 120 | 3 | 40.0000 |
Aquesta | 414 | 29 | 1 | 29.0000 |
seves | 485 | 51 | 2 | 25.5000 |
El | 2871 | 216 | 11 | 19.6364 |
però | 950 | 39 | 2 | 19.5000 |
La | 2795 | 232 | 12 | 19.3333 |
terme | 279 | 16 | 1 | 16.0000 |
Amb | 271 | 14 | 1 | 14.0000 |
Això | 184 | 14 | 1 | 14.0000 |
Les | 797 | 71 | 6 | 11.8333 |
I | 615 | 35 | 3 | 11.6667 |
seu | 1223 | 109 | 10 | 10.9000 |
És | 417 | 21 | 2 | 10.5000 |
Entre | 110 | 9 | 1 | 9.0000 |
seus | 626 | 53 | 6 | 8.8333 |
Si | 379 | 26 | 3 | 8.6667 |
Comissió | 765 | 16 | 2 | 8.0000 |
No | 429 | 22 | 3 | 7.3333 |
Quan | 118 | 7 | 1 | 7.0000 |
Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
crec | 101 | 1 | 9 | 0.1111 |
considerar | 43 | 1 | 7 | 0.1429 |
opció | 50 | 1 | 7 | 0.1429 |
anat | 59 | 1 | 7 | 0.1429 |
milions | 541 | 7 | 46 | 0.1522 |
seminari | 30 | 1 | 6 | 0.1667 |
basat | 60 | 1 | 6 | 0.1667 |
telèfon | 32 | 1 | 6 | 0.1667 |
concepte | 61 | 1 | 6 | 0.1667 |
metodologia | 53 | 1 | 6 | 0.1667 |
educatius | 35 | 1 | 6 | 0.1667 |
seguit | 63 | 1 | 6 | 0.1667 |
caràcter | 82 | 1 | 6 | 0.1667 |
mena | 57 | 1 | 6 | 0.1667 |
simple | 40 | 1 | 6 | 0.1667 |
banda | 307 | 3 | 16 | 0.1875 |
mapa | 49 | 1 | 5 | 0.2000 |
contracte | 41 | 1 | 5 | 0.2000 |
l’Ajuntament | 47 | 1 | 5 | 0.2000 |
greu | 38 | 1 | 5 | 0.2000 |
In this subsection, we compute the ratio of the number of right neighbors and the number of left neighbors. Again, we look for words with extreme ratios:
Data for first table:
select word,w.freq,aa.cnt, bb.cnt,aa.cnt/bb.cnt as r from words w, (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where w_id=aa.w1_id and aa.w1_id=bb.w2_id order by r desc limit 20;
Diagram data:
select aa.cnt, bb.cnt from (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where aa.w1_id=bb.w2_id;
5.1.7.1 Number of NN co-occurrences vs. Frequency I
5.1.7.2 Number of NN co-occurrences vs. Frequency II